MC Prediction

L605 MC Prediction Part 2 RENDER V3

## Quiz

To check your understanding of the video, please answer the question below.

SOLUTION:

If the agent follows a policy for many episodes, we can use the results to directly estimate the action-value function corresponding to the same policy.
The Q-table is used to estimate the action-value function.